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SEQUENCE LISTING PART OF THE DESCRIPTION 

SEQ. ID. NO. 1 - Wild type gagpol sequence for strain HXB2 (accession no. K03455) 

ATGGGTGCGA GAGCGTCAGT ATTAAGCGGG GGAGAATTAG ATCGATGGGA AAAAATTCGG 6 0 
TTAAGG C C AG GGGGAAAGAA AAAATATAAA TTAAAACATA TAGTATGGGC AAGCAGGGAG 12 0 
CTAGAACGAT TCGCAGTTAA TCCTGGCCTG TTAGAAACAT CAGAAGGCTG TAGACAAATA 18 0 
CTGGGACAGC TACAACCATC CCTTCAGACA GGATCAGAAG AACTTAGATC ATTATATAAT 24 0 
ACAGTAG CAA CCCTCTATTG TGTGCATCAA AGGATAGAGA TAAAAGACAC CAAGGAAGCT 3 00 
TTAGACAAGA TAGAGGAAGA GCAAAACAAA AGTAAGAAAA AAGCACAGCA AGCAGCAGCT 3 60 
GACACAGGAC ACAGCAATCA GGTCAGCCAA AATTACCCTA TAGTGCAGAA CATCCAGGGG 42 0 
CAAATGGTAC ATC AGGC CAT ATC AC CTAGA ACTTTAAATG CATGGGTAAA AGTAGTAGAA 48 0 
GAGAAGGCTT TCAGCCCAGA AGTGATACCC ATGTTTTCAG CATTATCAGA AGGAGCCACC 54 0 
CCACAAGATT TAAAC AC CAT GCTAAACACA GTGGGGGGAC ATCAAGCAGC CATGCAAATG 6 00 
TTAAAAGAGA CCATCAATGA GGAAGCTGCA GAATGGGATA GAGTGCATCC AGTGCATGCA 660 
GGGCCTATTG CACCAGGCCA GATGAGAGAA CCAAGGGGAA GTGACATAGC AGGAACTACT 72 0 
AGTACCCTTC AGGAACAAAT AGGATGGATG ACAAATAATC CACCTATCCC AGTAGGAGAA 78 0 
ATTTATAAAA GATGGATAAT CCTGGGATTA AATAAAATAG TAAGAATGTA TAGCCCTACC 84 0 
AGCATTCTGG ACATAAGACA AGGACCAAAG GAACCCTTTA GAGACTATGT AGACCGGTTC 90 0 
TATAAAACTC TAAGAGC CGA GCAAGCTTCA CAGGAGGTAA AAAATTGGAT GACAGAAACC 96 0 
TTGTTGGTCC AAAATGCGAA CCCAGATTGT AAGACTATTT TAAAAGCATT GGGACC AG CG 102 0 
GCTACACTAG AAGAAATGAT GACAGCATGT CAGGGAGTAG GAGGACCCGG CCATAAGGCA 10 8 0 
AGAGTTTTGG CTGAAGCAAT GAGCCAAGTA ACAAATTCAG CTACCATAAT GATGCAGAGA 114 0 
GGCAATTTTA GGAACCAAAG AAAGATTGTT AAGTGTTTCA ATTGTGGCAA AGAAGGGCAC 1200 
ACAGC CAGAA, ATTGCAGGGC CCCTAGGAAA AAGGGCTGTT GGAAATGTGG AAAGGAAGGA 12 6 0 
CACCAAATGA AAGATTGTAC TGAGAGACAG GCTAATTTTT TAGGGAAGAT CTGGCCTTCC 13 2 0 
TACAAGGGAA GGCCAGGGAA TTTTCTTCAG AGCAGACCAG AGCCAACAGC CCCACCAGAA 13 8 0 
GAGAGCTTCA GGTCTGGGGT AGAGACAACA ACTCCCCCTC AGAAGCAGGA GCCGATAGAC 14 4 0 
AAGGAACTGT ATCCTTTAAC TTCCCTCAGG TC ACT CTTTG GCAACGACCC CTCGTCACAA 15 0 0 
TAAAGATAGG GGGGCAACTA AAGGAAGCTC TATTAGATAC AGGAGCAGAT GATACAGTAT 156 0 
TAGAAGAAAT GAGTTTG CCA GGAAGATGGA AACCAAAAAT GATAGGGGGA ATTGGAGGTT 162 0 
TTATCAAAGT AAG AC AG TAT GATCAGATAC TCATAGAAAT CTGTGGACAT AAAG CTATAG 16 8 0 
GTACAGTATT AGTAGGACCT ACACCTGTCA ACATAATTGG AAGAAATCTG TTGACTCAGA 174 0 
TTGGTTGCAC TTTAAATTTT CCCATTAGCC CTATTGAGAC TGTACCAGTA AAATTAAAGC 18 0 0 
CAGGAATGGA TGGCCCAAAA GTTAAACAAT GGCCATTGAC AGAAGAAAAA ATAAAAGCAT 186 0 
TAGTAGAAAT TTGTACAGAG ATGGAAAAGG AAGGGAAAAT TTCAAAAATT GGGCCTGAAA 192 0 
ATCCATACAA TACTCCAGTA TTTGCCATAA AGAAAAAAGA CAGTACTAAA TGGAGAAAAT 198 0 
TAGTAGATTT CAGAGAACTT AATAAGAGAA CTCAAGACTT CTGGGAAGTT CAATTAGGAA 2 04 0 
TACCACATCC CGCAGGGTTA AAAAAGAAAA AATCAGTAAC AGTACTGGAT GTGGGTGATG 210 0 
CATATTTTTC AGTTCCCTTA GATGAAGACT TCAGGAAGTA TACTGCATTT ACCATACCTA 216 0 
GTATAAACAA TGAGACACCA GGGATTAGAT ATCAG TACAA TGTGCTTCCA CAGGGATGGA 222 0 
AAGGATCACC AGCAATATTC CAAAGT AG C A TGACAAAAAT CTTAGAGCCT TTTAGAAAAC 22 8 0 
AAAATC C AG A CATAGTTATC TATCAATACA TGGATGATTT GTATGTAGGA TCTGACTTAG 234 0 
AAATAGGGCA GCATAGAACA AAAATAGAGG AG CTGAGAC A ACATCTGTTG AGGTGGGGAC 2 4 00 
TTACCACACC AGACAAAAAA CATCAGAAAG AACCTCCATT CCTTTGGATG GGTTATGAAC 246 0 
TCCATCCTGA TAAATGGACA GTACAGCCTA TAGTGCTGCC AGAAAAAGAC AGCTGGACTG 2 52 0 
TCAATGACAT ACAGAAGTTA GTGGGGAAAT TGAATTGGGC AAGTCAGATT TACCCAGGGA 2 58 0 
TTAAAGTAAG GCAATTATGT AAACTCCTTA GAGGAACCAA AGCACTAACA GAAGTAATAC 264 0 
CACTAACAGA AG AAG CAG AG CTAGAACTGG CAGAAAACAG AGAGATTCTA AAAGAACCAG 2 700 
TACATGGAGT GTATTATGAC CCATCAAAAG ACTTAATAGC AGAAATACAG AAGCAGGGGC 2 76 0 
AAGGC CAATG GACATATCAA ATTTATCAAG AGCCATTTAA AAATCTGAAA ACAGGAAAAT 2 820 
ATGCAAGAAT GAGGGGTGCC CACACTAATG ATGTAAAACA ATTAACAGAG GCAGTGCAAA 2 880 
AAATAACCAC AGAAAGCATA GTAATATGGG GAAAGACTCC TAAATTTAAA CTGCCCATAC 2 94 0 
AAAAGGAAAC ATGGGAAACA TGGTGGACAG AGTATTGGCA AGCCACCTGG ATTCCTGAGT 3 000 
GGGAGTTTGT TAATACCCCT CCCTTAGTGA AATTATGGTA CCAGTTAGAG AAAG AAC C C A 3 060 
TAGTAGGAGC AGAAACCTTC TATGTAGATG GGGCAGCTAA CAGGGAGACT AAATTAGGAA 3120 
AAGCAGGATA TGTTACTAAT AGAGGAAGAC AAAAAGTTGT CACCCTAACT GACACAACAA 318 0 
ATCAGAAGAC TGAGTTACAA GCAATTTATC TAGCTTTGCA GGATTCGGGA TTAGAAGTAA 3 24 0 
ACATAGTAAC AGACTCACAA TATGCATTAG GAATCATTCA AGCACAACCA GATCAAAGTG 3 3 00 
AATCAGAGTT AGTCAATCAA AT AAT AG AG C AGTTAATAAA AAAGGAAAAG GTCTATCTGG 33 6 0 
CATGGGTACC AG C ACACAAA GGAATTGGAG GAAATGAACA AGTAGATAAA TTAGTCAGTG 3420 
CTGGAAT CAG GAAAGTACTA TTTTTAGATG GAATAGATAA GGCCCAAGAT GAACATGAGA 348 0 
AATATCACAG TAATTGGAGA GCAATGGCTA GTGATTTTAA CCTGCCACCT GTAGTAGCAA 3 54 0 
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AAGAAATAGT 
TAGACTGTAG 
TGGTAGCAGT 
GGCAGGAAAC 
ATACTGACAA 
GAATCAAGCA 
TGAATAAAGA 
CAGCAGTACA 
ACAGTGCAGG 
AAAAACAAAT 
TTTGGAAAGG 
ATAGTGACAT 
AGATGGCAGG 
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AGCCAGCTGT 
TCCAGGAATA 
TCATGTAGCC 
AG CAT ATTTT 
TGGCAGCAAT 
GGAATTTGGA 
ATTAAAGAAA 
AATGGCAGTA 
GGAAAGAATA 
TACAAAAATT 
ACCAGCAAAG 
AAAAGTAGTG 
TGATGATTGT 



GATAAATGTC 
TGGCAACTAG 
AGTGGATATA 
CTTTTAAAAT 
TTCACCGGTG 
ATTCCCTACA 
ATTATAGGAC 
TTCATCCACA 
G TAG AC AT AA 
CAAAATTTTC 
CTCCTCTGGA 
CCAAGAAGAA 
GTGGCAAGTA 



-2- 

AGCTAAAAGG 
ATTGTACACA 
TAGAAGCAGA 
TAG C AGG AAG 
CTACGGTTAG 
ATCCCCAAAG 
AGGTAAGAGA 
ATTTTAAAAG 
TAGCAACAGA 
GGGTTTATTA 
AAGGTGAAGG 
AAGCAAAGAT 
GACAGGATGA 



AG AAG C CAT G 
TTTAGAAGGA 
AGTTATTCCA 
ATGGCCAGTA 
GGCCGCCTGT 
TCAAGGAGTA 
TCAGGCTGAA 
AAAAGGGGGG 
CATACAAACT 
CAGGGACAGC 
GGCAGTAGTA 
CATTAGGGAT 
GGATTAG 



CATGGACAAG 
AAAGTTATCC 
GCAGAAACAG 
AAAACAATAC 
TGGTGGGCGG 
GTAGAATCTA 
CATCTTAAGA 
ATTGGGGGGT 
AAAGAATTAC 
AGAAATTCAC 
ATACAAGATA 
TATGGAAAAC 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4307 



SEQ l.D. NO. 2 - gagpol-SYNgp - codon optimised gagpol sequence 

ATGGGCGCCC GCGCCAGCGT GCTGTCGGGC GGCGAGCTGG ACCGCTGGGA GAAGATCCGC 6 0 

CTGCGCCCCG GCGGCAAAAA GAAGTACAAG CTGAAGCACA TCGTGTGGGC CAGCCGCGAA 12 0 

CTGGAG CGCT TCGCCGTGAA CCCCGGGCTC CTGGAGACCA GCGAGGGGTG CCGCCAGATC 180 

CTCGGCCAAC TGCAGCCCAG CCTGCAAACC GGCAGCGAGG AGCTGCGCAG CCTGTACAAC 24 0 

ACCGTGGCCA CGCTGTACTG CGTCCACCAG CGCATCGAAA TCAAGGATAC GAAAGAGGCC 3 00 

CTGGATAAAA TCGAAGAGGA ACAGAATAAG AGCAAAAAGA AGGCCCAACA GGCCGCCGCG 360 

GACACCGGAC ACAGCAACCA GGTCAGCCAG AACTACCCCA TCGTGCAGAA CAT CCAGGGG 420 

CAGATGGTGC AC C AGGC CAT CTCCCCCCGC ACGCTGAACG CCTGGGTGAA GGTGGTGGAA 480 

GAGAAGGCTT TTAGCCCGGA GGTGATACCC ATGTTCTCAG CCCTGTCAGA GGGAGC C AC C 54 0 

CCCCAAGATC TGAACACCAT G CTCAAC AC A GTGGGGGGAC ACCAGGCCGC CATGC AG AT G 6 00 

CTGAAGGAGA CCATCAATGA GGAGGCTGCC GAATGGGATC GTGTGCATCC GGTGCACGCA 660 

GGGCCCATCG CACCGGGCCA G ATG CGTGAG CCACGGGGCT CAGACATCGC CGGAACG AC T 720 

AGTACCCTTC AGGAACAGAT CGGCTGGATG ACCAACAACC CACCCATCCC GGTGGGAGAA 780 

ATCTACAAAC GCTGGATCAT CCTGGGCCTG AACAAGATCG TGCGCATGTA TAGCCCTACC 84 0 

AGCATCCTGG ACATCCGCCA AGGCCCGAAG GAACCCTTTC GCGACTACGT GGACCGGTTC 900 

TACAAAACGC TCCGCGCCGA GCAGGCTAGC CAGGAGGTGA AGAACTGGAT GACCGAAACC 960 

CTGCTGGTCC AGAACGCGAA CCCGGACTGC AAGACGATCC TGAAGGCCCT GGGCCCAGCG 1020 

GCTACCCTAG AGGAAATGAT GACCGCCTGT CAGGGAGTGG GCGGACCCGG CCACAAGGCA 108 0 

CGCGTCCTGG CTGAGGCCAT GAGCCAGGTG ACCAACTCCG CTAC CAT CAT GATGCAGCGC 114 0 

GGCAACTTTC GGAACCAACG CAAGATCGTC AAGTGCTTCA ~ ACTGTGGCAA AGAAGGGCAC 1200 

ACAGCCCGCA AC TG CAGGGC CCCTAGGAAA AAGGGCTGCT GGAAATGCGG CAAGGAAGGC 126 0 

CACCAGATGA AAGACTGTAC TGAGAGACAG GCTAATTTTT TAGGGAAGAT CTGGCCTTCC 13 2 0 

TACAAGGGAA GGCCAGGGAA TTTTCTTCAG AGCAGACCAG AGCCAACAGC CCCACCAGAA 13 8 0 

GAGAGCTTCA GGTCTGGGGT AGAGACAACA ACTCCCCCTC AGAAGCAGGA GC CGATAG AC 1440 

AAGGAACTGT AT CCTTTAAC TTCCCTCAGA TCACTCTTTG GCAACGACCC CTCGTCACAA 15 0 0 

TAAAGATAGG GGGGCAGCTC AAGGAGGCTC T C CTGG AC AC CGGAGCAGAC GACACCGTGC 15 6 0 

TGGAGGAGAT GTCGTTGCCA GGCCGCTGGA AGC CGAAGAT GATCGGGGGA ATCGGCGGTT 162 0 

TCATCAAGGT GCGCCAGTAT GACCAGATCC T C ATCGAAAT CTGCGGCCAC AAGGCTATCG 16 80 

GTACCGTGCT GGTGGGCCCC ACACCCGTCA ACATCATCGG ACGCAACCTG TTGACGCAGA 174 0 

TCGGTTGCAC GCTGAACTTC CCCATTAGCC CTATCGAGAC GGTACCGGTG AAG CTGAAG C 18 0 0 

CCGGGATGGA CGGCCCGAAG GTCAAGCAAT GGCCATTGAC AGAGGAGAAG ATCAAGGCAC 18 6 0 

TGGTGGAGAT TTG C AC AG AG ATGGAAAAGG AAGGGAAAAT CTCCAAGATT GGGCCTGAGA 192 0 

ACCCGTACAA CACGCCGGTG TTCGCAATCA AGAAGAAGGA CTCGACGAAA TGGCGCAAGC 19 8 0 

TGGTGGACTT CCGCGAGCTG AACAAGCGCA . CGCAAGACTT CTGGGAGGTT CAGCTGGGCA 2 04 0 

TCCCGCACCC CGCAGGGCTG AAGAAGAAGA AATCCGTGAC CGTACTGGAT GTGGGTGATG 210 0 

CCTACTTCTC CGTTCCCCTG GACGAAGACT TCAGGAAGTA CACTGCCTTC ACAATCCCTT 216 0 

CGATCAACAA CGAGACACCG GGGATTCGAT ATCAGTACAA CGTGCTGCCC CAGGGCTGGA 222 0 

AAGGCTCTCC CGCAATCTTC CAGAGTAGCA TGACCAAAAT CCTGGAGCCT TTCCGCAAAC 2 2 80 

AGAACCCCGA CATCGTCATC TATCAGTACA TGGATGACTT GTACGTGGGC TCTGATCTAG 2 34 0 

AGATAGGGCA GCACCGCACC AAG AT CG AGG AGCTGCGCCA GCACCTGTTG AGGTGGGGAC 2 40 0 

TGACCACACC CGACAAGAAG CACCAGAAGG AGCCTCCCTT CCTCTGGATG GGTTACGAGC 246 0 

TGCACCCTGA CAAATGGACC GTGCAGCCTA TCGTGCTGCC AGAGAAAGAC AGCTGGACTG 2 52 0 
TCAACGACAT ACAGAAGCTG GTGGGGAAGT TGAACTGGGC CAGTCAGATT TACCCAGGGA 2 5 80 
TTAAGGTGAG GCAGCTGTGC AAACTCCTCC GCGGAACCAA GGCACTCACA GAGGTG AT CC 2 64 0 
CCCTAACCGA GGAGGCCGAG CTCGAACTGG C AGAAAAC CG AGAGATCCTA AAGGAGCCCG 2 7 00 
TGCACGGCGT GTACTATGAC CCCTCCAAGG ACCTGATCGC CGAGATCCAG AAGCAGGGGC 2 76 0 
AAGGCCAGTG G AC CTATC AG ATTTACCAGG AGCCCTTCAA G AAC CTGAAG ACCGGCAAGT 2 82 0 
ACGCCCGGAT GAGGGGTGCC CACACTAACG ACGTCAAGCA GCTGACCGAG GCCGTGCAGA 2 8 80 
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AGATCACCAC 
AGAAGGAAAC 
GGGAGTTCGT 
TAGTGGGCGC 
AAGCCGGATA 
AC C AG AAGAC 
ACATCGTGAC 
AGTCCGAGCT 
CCTGGGTACC 
CTGGCATCAG 
AATAC C AC AG 
AAGAGATCGT 
TGGACTGTAG 
TGGTAGCCGT 
GGCAGGAGAC 
ATACTGACAA 
GAAT CAAGCA 
TGAATAAGGA 
CCGCGGTCCA 
ACAGTGCGGG 
AAAAG C AG AT 
TCTGGAAAGG 
ATAGCGACAT 
AGATGGCGGG 



CGAAAGCATC 

CTGGGAAACC 

CAACACCCCT 

CGAAACCTTC 

CGTCACTAAC 

TGAGCTGCAG 

AGACTCTCAG ' 

GGTCAATCAG 

CGCCCACAAA 

GAAGGTGCTA 

CAACTGGCGG 

GGCCAG CTGT 

CCCCGGCATC 

CCATGTGGCC 

AG CCT ACTTC 

TGGCAGCAAT 

GGAGTTCGGG 

GTTAAAGAAG 

AATGGCGGTA 

GGAGCGGATC 

TACCAAGATT 

CCCAGCGAAG 

CAAGGTGGTG 

TGATGATTGC 



GTGATCTGGG 
TGGTGGACAG 
CCCCTGGTGA 
TACGTGGATG 
CGGGGCAGAC 
GCCATTTACC 
TATGCCCTGG 
ATCATCGAGC 
GGCATTGGCG 
TTCCTGGATG 
GCCATGGCTA 
GACAAGTGTC 
TGGCAACTCG 
AGTGGCTACA 
CTCCTGAAGC 
TTCACCAGTG 
ATCCCCTACA 
ATTATCGGCC 
TTCATCCACA 
GTGGAC AT CA 
CAGAATTTCC 
CTCCTCTGGA 
CCCAGAAGAA 
GTGGCGAGCA 



GAAAGACTCC 
AGTATTGGCA 
AGCTGTGGTA 
GGGCCGCTAA 
AGAAGGTTGT 
TCGCTTTGCA 
GCATCATTCA 
AGCTGATCAA 
GCAATGAGCA 
GCATCGACAA 
GCGACTTCAA 
AG CTC AAGGG 
ATTGCACCCA 
TCGAGGCCGA 
TGGCAGGCCG 
CTACGGTTAA 
ATC CCCAGAG 
AGGTCAGAGA 
ATTTCAAGCG 
TCGCGACCGA 
GGGTCTACTA 
AGGGTGAGGG 
AGGCGAAGAT 
GACAGGATGA 



TAAGTTCAAG 
GGCCACCTGG 
CCAGCTGGAG 
CAGGGAGACT 
CACCCTCACT 
GGACTCGGGC 
AGCCCAGCCA 
GAAGGAAAAG 
GGTCGACAAG 
GGCCCAGGAC 
CCTGCCCCCT 
CGAAGCCATG 
TCTGGAGGGC 
GGTCATTCCC 
GTGGCCAGTG 
GGCCGCCTGC 
TCAGGGCGTC 
TCAGGCTGAG 
GAAGGGGGGG 
CATCCAGACT 
CAGGGACAGC 
GGCAGTAGTG 
CATTAGGGAT 
GGATTAG 



CTGCCCATCC 
ATTCCTGAGT 
AAGGAG C CCA 
AAGCTGGGCA 
GACACCACCA 
CTGGAGGTGA 
GACCAGAGTG 
GTCTATCTGG 
CTGGTCTCGG 
GAGCACGAGA 
GTGGTGGCCA 
CATGGCCAGG 
AAGGTTATCC 
GCCGAAACAG 
AAGACCATCC 
TGGTGGGCGG 
GTCGAGTCTA 
CATCTCAAGA 
ATTGGGGGGT 
AAGGAGCTGC 
AGAAATCCCC 
ATCCAGGATA 
TATGGCAAAC 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4307 



SEQ. ID. NO. 3 - Envelope Gene from HIV-1 MN (Genbank accession no. Ml 7449) 



ATGAGAGTGA 
CTTGGGTTAT 
GTACCTGTGT 
GATACAGAGG 
CAAGAAGTAG 
GAACAGATGC 
TTAACCCCAC 
AATAGTACTG 
AACTGCTCTT 
CTTTATAAAC 
TGTAATACCT 
CACTATTGTG 
AAAGGATCAT 
TCAACTCAAC 
AATTTCACTG 
TGTACAAGAC 
TATACAACAA 
AAATGGAATG 
ACAATAGTCT 
TGTGGAGGGG 
AATAATACTT 
AAACAAATTA 
GGACAAATTA 
GACACGGACA 
TGGAGAAGTG 
ACCAAGGCAA 
CTTGGGTTCT 
C AGG C C AG AC 
GAGGCGCAAC 
GTCCTGGCTG 
GGAAAACTCA 
GATGATATTT 
AGCTTAATAT 
TTATTGGAAT 
TGGTATATAA 
GCTGTACTTT 
CGCCCCCCAG 



AGGGGATCAG 
TAATG AT CTG 
GGAAAGAAGC 
TACATAATGT 
AATTGGTAAA 
ATGAGGATAT 
TCTGTGTTAC 
CTAATAACAA 
TCAATATCAC 
TTGATATAGT 
CAGTCATTAC 
CCCCGGCTGG 
GTAAAAATGT 
TGCTGTTAAA 
ATAATGCTAA 
CCAACTACAA 
AAAATATAAT 
ACACTTTAAG 
TTAATCAATC 
AATTTTTCTA 
GGAATAATAC 
TAAACATGTG 
GATGTTCATC 
CGAACGACAC 
AATTATATAA 
AGAGAAGAGT 
TAGGAGCAGC 
TATTATTGTC 
AG C AT ATGTT 
TGGAAAGATA 
TTTGCACCAC 
GGAATAACAT 
ACTCATTACT 
TGGATAAATG 
AAAT ATT CAT 
CTATAGTGAA 
TTCCGAGGGG 



GAGGAATTAT 
TAGTGCTACA 
AACCACCACT 
TTGGGC CACA 
TGTGACAGAA 
AATCAGTTTA 
TTTAAATTGC 
TAGTAATAGC 
CACAAGCATA 
ATCAATAGAT 
ACAAGCTTGT 
TTTTGCGATT 
CAGCACAGTA 
TGGCAGTCTA 
AACCATCATA 
TAAAAGAAAA 
AGGAACTATA 
ACAGATAGTT 
CTCAGGAGGG 
CTGTAATACA 
TACAGGGTCA 
GCAGGAAGTA 
AAATATTACA 
CGAGATCTTC 
ATATAAAGTA 
GGTGCAGAGA 
AGGAAGCACT 
TGGTATAGTG 
GCAACTCACA 
CCTAAAGGAT 
TACTGTGCCT 
GACCTGGATG 
AG AAAAATC G 
GGCAAGTTTG 
AATGATAGTA 
TAGAGTTAGG 
ACCCGACAGG 



CAGCACTGGT 

GAAAAATTGT 

CTATTTTGTG 

CAAGCCTGTG 

AATTTTAACA 

TGGGATCAAA 

ACTGATTTGA 

GAGGGAACAA 

AGAGATAAGA 

AATGATAGTA 

CCAAAGATAT 

CTAAAATGTA 

CAATGTACAC 

GCAGAAGAAG 

GTACATCTGA 

AGGATACATA 

AGACAAGCAC 

AG C AAAT T AA 

GACCCAGAAA 

TCACCACTGT 

AATAACAATA 

GG AAAAG CAA 

GGGCTACTAT 

AGACCTGGAG 

GTAACAATTG 

GAAAAAAGAG 

ATGGGCGCAG 

CAACAGCAGA 

GTCTGGGGCA 

CAACAGCTCC 

TGGAATGCTA 

CAGTGGGAAA 

CAAACCCAAC 

TGG AATTGGT ■ 

GGAGGCTTGG 

CAGGGATACT 

CCCGAAGGAA 



GGGGATGGGG 

GGGTCACAGT 

CATCAGATGC 

TACCCACAGA 

TGTGGAAAAA 

GCCTAAAGCC 

GGAATACTAC 

TAAAGGGAGG 

TGCAGAAAGA 

CCAG CTAT AG 

CCTTTGAGCC 

ACGATAAAAA 

ATGGAATTAG 

AGGT AG TAAT 

ATGAAT CTGT 

TAGGACCAGG 

ATTGTAACAT 

AAGAACAATT 

TTGTAATGCA 

TTAATAGTAC 

TCACACTTCA 

TGTATGCCCC 

TAACAAGAGA 

GAGGAGATAT 

AACCATTAGG 

CAGCGATAGG 

CGTCAGTGAC 

ACAATTTGCT. 

TCAAGCAGCT 

TGGGGTTTTG 

GTTGGAGTAA 

GAGAAATTGA 

AAGAAAAGAA 

TTGACATAAC 

TAGGTTTAAG 

CACCATTGTC 

TCGAAGAAGA 



CACGATGCTC 
CTATTATGGG 
TAAAGCATAT 
CCCCAACCCA 
TAACATGGTA 
ATGTGTAAAA 
TAATACCAAT 
AGAAATGAAA 
ATATGCACTT 
GTTGATAAGT 
AATTCCCATA 
GTTCAGTGGA 
GCCAGTAGTA 
TAGATCTGAG 
ACAAATTAAT 
GAGAGCATTT 
TAGTAGAGCA 
TAAGAATAAA 
CAGTTTTAAT 
TTGGAATGGT 
ATGCAAAATA 
TCCCATTGAA 
TGGTGGTAAG 
GAGGGACAAT 
AGTAGCACCC 
AGCTCTGTTC 
GCTGACGGTA 
GAGGGCCATT 
CCAGGCAAGA 
GGGTTGCTCT 
TAAATCTCTG 
CAATTACACA 
TGAACAAGAA 
AAATTGGCTG 
AATAGTTTTT 
GTTG CAGACC 
AGGTGGAGAG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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AGAGACAGAG ACACATCCGG TCGATTAGTG CATGGATTCT TAGCAATTAT CTGGGTCGAC 22 8 0 

CTGCGGAGCC TGTTCCTCTT CAGCTACCAC CACAGAGACT TACTCTTGAT TGCAGCGAGG 23 4 0 

ATTGTGGAAC TTCTGGGACG CAGGGGGTGG GAAGTCCTCA AATATTGGTG GAATCTCCTA 2 4 00 

CAGTATTGGA GTCAGGAACT AAAGAGTAGT GCTGTTAGCT TGCTTAATGC CACAGCTATA 246 0 

GCAGTAGCTG AGGGGACAGA TAGGGTTATA GAAGTACTGC AAAGAGCTGG TAGAGCTATT 2 52 0 

CTCCACATAC CTACAAGAAT AAGACAGGGC TTGGAAAGGG CTTTGCTATA A 2 5 71 



SEQ. LD. NO. 4 - SYNgp-160mn - codon optimised env sequence 

ATGAGGGTGA AGGGGATCCG CCGCAACTAC CAGCACTGGT GGGG CTGGGG CACGATGCTC 60 
CTGGGGCTGC TGATGATCTG CAGCGCCACC GAGAAGCTGT GGGTGACCGT GTACTACGGC 12 0 
GTGCCCGTGT GGAAGGAGGC CACCACCACC CTGTTCTGCG CCAGCGACGC CAAGGCGTAC 18 0 
GACACCGAGG TGCACAACGT GTGGGCCACC CAGGCGTGCG TGCCCACCGA CCCCAACCCC 24 0 
CAGGAGGTGG AGCTCGTGAA CGTGACCGAG AACTTCAACA TGTGGAAGAA CAACATGGTG 3 00 
GAGCAGATGC ATGAGGACAT CATC AG C CTG TGGGACCAGA GCCTGAAGCC CTGCGTGAAG 360 
CTGACCCCCC TGTGCGTGAC CCTGAACTGC ACCGACCTGA GGAACACCAC CAACACCAAC 42 0 
AACAGC AC CG CCAACAACAA CAGCAACAGC GAGGGCACCA TCAAGGGCGG CGAGATGAAG 4 80 
AACTGCAGCT TCAACATCAC CACCAGCATC CGCGACAAGA TGCAGAAGGA GTACGCCCTG 54 0 
CTGTACAAGC TGGATATCGT GAG CATCGAC AACGACAGCA CCAGCTACCG CCTGATCTCC 6 00 
TGCAACACCA GCGTGATCAC CCAGGCCTGC CCCAAGATCA GCTTCGAGCC CATCCCCATC 66 0 
CACTACTGCG CCCCCGCCGG CTTCGCCATC CTGAAGTGCA ACGACAAGAA GTTCAGCGGC 72 0 
AAGGGCAGCT GCAAGAACGT GAGCACCGTG CAGTGCACCC ACGGCATCCG GCCGGTGGTG 78 0 
AGCACCCAGC TCCTGCTGAA CGGCAGCCTG GCCGAGGAGG AGGTGGTGAT CCGCAGCGAG 84 0 
AACTTCACCG ACAACGCCAA GACCATCATC GTGCACCTGA ATGAGAGCGT GCAGATCAAC 90 0 
TGCACGCGTC CCAACTACAA CAAGCGCAAG CGCATCCACA TCGGCCCCGG GCGCGCCTTC 960 
TACACCACCA AGAACATCAT CGGCACCATC CGCCAGGCCC ACTGCAACAT CTCTAGAGCC 102 0 
AAGTGGAACG ACACCCTGCG CCAGATCGTG AGCAAGCTGA AGGAGCAGTT CAAGAACAAG 108 0 
ACCATCGTGT TCAACCAGAG CAGCGGCGGC GACCCCGAGA TCGTGATGCA C AG CTTCAAC 114 0 
TGCGGCGGCG AATTCTTCTA CTGCAACACC AGCCCCCTGT TCAACAGCAC CTGGAACGGC 12 00 
AACAAC AC CT GGAACAACAC CACCGGCAGC AACAACAATA TTACCCTCCA GTGCAAGATC 12 60 
AAGCAGATCA TCAACATGTG GCAGGAGGTG GGCAAGGCCA TGTACGCCCC CCCCATCGAG 132 0 
GGCCAGATCC GGTGCAGCAG CAACATCACC GGTCTGCTGC TG AC C CG CG A CGGCGGCAAG 13 80 
GACACCGACA CCAACGACAC CGAAATCTTC CGCCCCGGCG GCGGCGACAT GCG CGACAAC 144 0 
TGGAGATCTG AGCTGTACAA GTACAAG GTG GTGACGATCG AGCCCCTGGG CGTGGCCCCC 15 00 
ACCAAGGCCA AGCGCCGCGT GGTGCAGCGC GAGAAG CGGG CCGCCATCGG CGCCCTGTTC 1560 
CTGGGCTTCC TGGGGGCGGC GGGCAGCACC ATGGGGGCCG CCAGCGTGAC CCTGACCGTG 162 0 
CAGGCCCGCC TGCTCCTGAG CGGCATCGTG CAGCAGCAGA ACAACCTCCT CCGCGCCATC 16 8 0 
GAGGCCCAGC AGCATATGCT CCAGCTCACC GTGTGGGGCA TCAAGCAGCT CCAGGCCCGC 174 0 
GTGCTGGCCG TGGAGCGCTA CCTGAAGGAC CAGCAGCTCC TGGGCTTCTG GGGCTGCTCC 18 0 0 
GGCAAGCTGA TCTGCACCAC CACGGTACCC TGGAACG CCT CCTGGAGCAA CAAGAGCCTG 18 6 0 
GACGACATCT GGAACAACAT GACCTGGATG CAGTGGGAGC GCGAGATCGA TAACTACACC 192 0 
AGCCTGATCT ACAGCCTGCT GGAGAAGAGC CAGACCCAGC AGGAGAAGAA CGAGCAGGAG 198 0 
CTGCTGGAGC TGGACAAGTG GGCGAGCCTG TGGAACTGGT TCGACATCAC CAACTGGCTG 204 0 
TGGT AC AT C A AAATCTTCAT CATGATTGTG GGCGGCCTGG TGGGCCTCCG CATCGTGTTC 2100 
GCCGTGCTGA GCATCGTGAA CCGCGTGCGC CAGGGCTACA GCCCCCTGAG CCTCCAGACC 216 0 
CGGCCCCCCG TGCCGCGCGG GCCCGACCGC CCCGAGGGCA TCGAGGAGGA GGGCGGCGAG 222 0 
CGCGACCGCG ACACCAGCGG CAGGCTCGTG CACGGCTTCC TGGCG AT CAT CTGGGTCGAC 228 0 
CTCCGCAGCC TGTTCCTGTT CAGCTACCAC CACCGCGACC TGCTGCTGAT CGCCGCCCGC 234 0 
ATCGTGGAAC TCCTAGGCCG CCGCGGCTGG GAGGTGCTGA AGTACTGGTG GAACCTCCTC 24 00 
CAGTATTGGA GCCAGGAGCT GAAGTCC AG C GCCGTGAGCC TGCTGAACGC CACCGCCATC 246 0 
GCCGTGGCCG AGGGCACCGA CCGCGTGATC GAGGTGCTCC AGAGGGC CGG GAGGGCGATC 2 52 0 
CTGCACATCC CCACCCGCAT CCGCCAGGGG CTCGAGAGGG CGCTGCTGTA A 2 5 71 



SEQ. LD. NO. 1 1 - Complete Sequence of pH4DOZENEGS 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG CGCAGCGTGA 60 
CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG 12 0 
CCACGTTCGC CGGCTTTCCC CGTCAAGCTC TAAATCGGGG GCTCCCTTTA GGGTTC CG AT 18 0 
TTAGTG CTTT ACGGCACCTC GACCCCAAAA AACTTGATTA GGGTGATGGT TCACGTAGTG 24 0 
GGCCATCGCC CTGATAGACG GTTTTTCGCC CTTTGACGTT GGAGTCCACG TTCTTTAATA 300 
GTGGACTCTT GTTCCAAACT GGAACAACAC TCAACCCTAT CTCGGTCTAT TCTTTTGATT 360 
TATAAGGGAT TTTGCCGATT TCGGCCTATT GGTTAAAAAA TGAGCTGATT TAACAAAAAT 42 0 
TTAACGCGAA TTTTAACAAA ATATTAACGC TTACAATTTC CATTCGCCAT TCAGGCTGCG 48 0 
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GAACTGTTGG GAAGGGCGAT CGGTGCGGGC CTCTTCGCTA TTACGCCAGC TGGCGAAAGG 54 0 
GGGATGTGCT GCAAGGCGAT TAAGTTGGGT AACGCCAGGG TTTTCCCAGT CACGACGTTG 60 0 
TAAAACGACG GCCAGTGAGC GCGCGTAATA CGACTCACTA TAGGGCGAAT TGGAGCTCCA 66 0 
CCGCGGTGGC GGCCGCTCTA GAGTCCGTTA CATAACTTAC GGTAAATGGC CCGCCTGGCT 72 0 
GACCGCCCAA CGACCCCCGC CCATTGACGT CAATAATGAC GTATGTTCCC ATAGTAACGC 78 0 
CAATAGGGAC TTTCCATTGA CGTCAATGGG TGGAGTATCT ACGGTAAACT GCCCACTTGG 84 0 
C AG TAC AT C A AGTGTATCAT ATGCCAAGTA CGCCCCCTAT TGACGTCAAT GACGGTAAAT 90 0 
GGCCCGCCTG GCATTATGCC CAGTACATGA CCTTATGGGA CTTTCCTACT TGGCAGTACA 96 0 
TCTACGTATT AGTCATCGCT ATTACCATGG TGATGCGGTT TTGG C AG TAC ATCAATGGGC 102 0 
GTGGATAGCG GTTTGACTCA CGGGGATTTC CAAGTCTCCA CCCCATTGAC GTCAATGGGA 108 0 
GTTTGTTTTG GCAC CAAAAT CAACGGGACT TTCCAAAATG TCGTAACAAC TCCGCCCCAT 114 0 
TGACG CAAAT GGGCGGTAGG CGTGTACGGT GGGAGGTCTA TATAAGCAGA GCTCGTTTAG 12 00 
TGAACCGGTC TCTCTGGTTA GACCAGATCT GAGC CTGGGA GCTCTCTGGC TAACTAGGGA 12 6 0 
ACCCACTGCT TAAGC CTCAA TAAAGCTTGC CTTGAGTGCT TCAAGTAGTG TGTGCCCGTC 132 0 
TGTTGTGTGA CTCTGGTAAC TAGAGATCCC TCAGACCCTT TTAGTCAGTG TGGAAAATCT 13 8 0 
CTAGCAGTGG CGCCCGAACA GGGACTTGAA AGCGAAAGGG AAACCAGAGG AGCTCTCTCG 144 0 
AC G C AGGACT CGGCTTGCTG AAGCGCGCAC GGCAAGAGGC GAGGGGCGGC GACTGGTGAG 15 0 0 
TACGCCAAAA ATTTTGACTA GCGGAGGCTA GAAGGAGAGA GATGGGTGCG AG AGCGT CAG 156 0 
TATTAAGCGG GGGAGAATTA GAT CGCGATG GGAAAAAATT CGGTTAAGGC CAGGGGGAAA 162 0 
GAAAAAATAT AAATTAAAAC ATATAGTATG GGCAAGCAGG GAGCTAGAAC GATTCGCAGT 168 0 
TAATCCTGGC CTGTTAGAAA CATCAGAAGG CTGTAGACAA ATACTGGGAC AGCTACAACC 174 0 ' 
ATCCCTTCAG ACAGGATCAG AAGAACTTAG ATCATTATAT AATACAGTAG CAACCCTCTA 18 00 
TTGTGTGCAT CAAAGGTTGA GATAAAAGAC ACCAAGGAAG CTTTAGACAA GATAGAGGGA 18 6 0 
GAGCAAAACA AAAGTAAGAA AAAAGCACAG CAAGCAGCAG CTGACACAGG ACACAGCAAT 192 o" 
CAGGTCAGCC AAAATTACCC TATAGTGCAG AACATCCAGG GGCAAATGGT ACATCAGGCC 198 0 
ATATCACCTA GAACTTTAAA TGCATGGGTA AAAGTAGTAG AAGAGAAGGC TTTCAGCCCA 2 04 0 
GAAGTGATAC CCATGTTTTC AGCATTATCA GAAGGAGCCA CCCCACAAGA TTTAAAC AC C 210 0 
ATG CTAAAC A CAGTGGGGGG ACATCAAGCA GCCATGCAAA TGTTAAAAGA GAC CAT CAAT 216 0 
GAGGAAGCTG CAGGAATTCG CCTAAAACTG CTTGTACCAA TTGCTATTGT AAAAAGTGTT 2 22 0 
GCTTTCATTG CCAAGTTTGT TTCATAACAA AAGCCTTAGG CATCTCCTAT GGCAGGAAGA 228 0 
AGCGGAGACA GCGACGAAGA GCTCATCAGA ACAGTCAGAC TCATCAAGCT TCTCTATCAA 2 34 0 
AGCAGTAAGT AGTACATGTA ACGCAACCTA TAC CAATAGT AGCAATAGTA GCATTAGTAG 240 0 
TAGCAATAAT AATAGCAATA GTTGTGTGGT C CAT AGTAAT CATAGAATAT AGGAAAATAT 2 46 0 
TAAGACAAAG AAAAATAGAC AGGTTAATTG ATAGACTAAT AGAAAGAGCA GAAGACAGTG 2 52 0 
GCAATGAGAG TGAAGGAGAA ATATCAGCAC TTGTGGAGAT GGGGGTGGAG ATGGGGCACC 2 58 0 
ATGCTCCTTG GGATGTTGAT GATCTGTAGT GCTACAGAAA AATTGTGGGT CACAGTCTAT 2 64 0 
TATGGGGTAC CTGTGTGGAA GGAAGCAACC ACCACTCTAT TTTGTGCATC AGATGCTAAA 2 70 0 
GCATAGATCT TCAGACTTGG AGGAGGAGAT ATGAGGGACA ATTGGAGAAG TGAATTATAT 2 76 0 
AAATATAAAG TAGTAAAAAT TGAACCATTA GGAGTAGCAC CCACCAAGGC AAAGAGAAGA 2 82 0 
GTGGTGCAGA GAGAAAAAAG AGCAGTGGGA ATAGGAGCTT TGTTCCTTGG GTTCTTGGGA 288 0 
GCAGCAGGAA GCACTATGGG CGCAGCGTCA ATGACGCTGA CGGTACAGGC CAGACAATTA 2 94 0 
TTGTCTGGTA TAGTGCAGCA GCAGAACAAT TTGCTGAGGG CTATTGAGGC GCAAC AG CAT 3 00 0 
CTGTTGCAAC TCACAGTCTG GGGCATCAAG CAGCTCCAGG CAAGAATCCT GGCTGTGGAA 3 060 
AG AT AC CT AA AGGATCAACA GCTCCTGGGG ATTTGGGGTT GCTCTGGAAA ACTCATTTGC 312 0 
ACCACTGCTG TGCCTTGGAA TGCTAGTTGG AGTAATAAAT CTCTGGAACA GATCTGGAAT 318 0 
CACACGACCT GGATGGAGTG GGACAGAGAA ATTAACAATT ACACAAGCTT AATACACTCC 3 24 0 
TTAATTGAAG AATCGCAAAA CCAGCAAGAA AAGAATGAAC AAGAATTATT GGAATTAGAT 3 3 00 
AAATGGGCAA GTTTGTGGAA TTGGTTTAAC ATAACAAATT GGCTGTGGTA TATAAAATTA 3 360 
TTCATAATGA TAGTAGGAGG CTTGGTAGGT TTAAGAATAG TTTTTGCTGT ACTTTCTATA 3 42 0 
GTGAATAGAG TTAGGCAGGG ATATTCACCA TTATCGTTTC AGACCCACCT CCCAACCCCG 34 8 0 
AGGGGACCCG ACAGGCCCGA AGGAATAGAA GAAGAAGGTG GAGAGAGAGA CAGAGACAGA 3 54 0 
TCCATTCGAT TAGTGAACGG ATCCTTGGCA CTTATCTGGG ACGATCTGCG GAGCCTGTGC 3600 
CTCTTCAGCT ACCACCGCTT GAGAGACTTA CTCTTGATTG TAACGAGGAT TGTGGAACTT 3660 
CTGGGACGCA GGGGGTGGGA AGCCCTCAAA TATTGGTGGA ATCTCCTACA GTATTGGAGT 3 72 0 
CAGGAACTAA AGAATAGTGC TGTTAGCTTG CTCAATGCCA CAGCCATAGC AGTAGCTGAG 3 78 0 
GGGACAGATA GGGTTATAGA AGTAGTACAA GGAGCTTGTA GAGCTATTCG CCACATACCT 3 84 0 
AGAAGAATAA GACAGGGCTT GGAAAGGATT TTGCTATAAG ATGGGTGGCA AGTGGTCAAA 3 900 
AAGTAGTGTG ATTGGATGGC CTACTGTAAG GGAAAGAATG AGACGAGCTG AGCCAGCAGC 3960 
AGATAGGGTG GGAGCAGCAT CTCGACGCTG CAGGAGTGGG GAGGCACGAT GGCCGCTTTG 4 02 0 
GTCGAGGCGG ATCCGGCCAT TAGCCATATT ATTCATTGGT TATATAGCAT AAATCAATAT 4 08 0 
TGGCTATTGG CCATTGCATA CGTTGTATCC ATATCATAAT ATGTACATTT ATATTGGCTC 414 0 
ATGTCCAACA TTACCGCCAT GTTGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT 42 0 0 
TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA 426 0 
TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 432 0 
TCCCATAGTA ACGC CAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA 43 8 0 
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AACTGCCCAC TTGGCAGTAC ATCAAGTGTA 
CAATGACGGT AAATGGCCCG CCTGGCATTA 
TACTTGGCAG TACATCTACG TATTAGTCAT 
GTACATCAAT GGGCGTGGAT AGCGGTTTGA 
TGACGTCAAT GGGAGTTTGT TTTGGCACCA 
CAACTCCGCC CCATTGACGC AAATGGGCGG 
CAGAGCTCGT TTAGTGAACC GTCAGATCGC 
CCATAGAAGA CACCGGGACC GATCCAGCCT 
CGGGGATGAC GTCATCGACT TCGAAGGTTC 
ACGTCATCGA CTTCGAAGGT TCGAATCCTT 
GACTTCGAAG GTTCGAATCC TTCTAGATTC 
AGGTT CGAAT CCTTCCAGTT CCACCAGTCG 
ATCCTTCTCT TCCCACCATT TTTTTTC CAC 
GGGCCCACCA GTCGAGGGCT ACGTCATCGA 
CATTTTTTCT GAACGTCATC GACTTCGAAG 
TATAACGTCA TCGACTTCGA AGGTTCGAAT 
CATCGACTTC GAAGGTTCGA ATCCTTCTTC 
TCGAAGGTTC GAATCCTTCG TAGTTCACCA 
TCGAATCCTT CTAGGCCCAC CAGTCGACGC 
GACCTAGAAA AACATGGAGC AATCACAAGT 
GCCTGGCTAG AAGCACAAGA GGAGGAGGAG 
TTAAG AC CAA TGACTTACAA GGCAGCTGTA 
GGAC TGGAAG GGCTAATTCA CTCCCAACGA 
CACACACAAG GCTACTTCCC TGATTGGCAG 
CCACTGACCT TTGGATGGTG CTACAAGCTA 
GCCAATGAAG GAGAGAACAC CCGCTTGTTA 
CCGGAGAGAG AAGTATTAGA GTGGAGGTTT 
CGAGAGCTGC ATCCGGAGTA CTTCAAGAAC 
TCCGCTGGGG ACTTTCCAGG GAGGCGTGGC 
AGATGCTGCA TATAAGCAGC TGCTTTTTGC 
CTGAGCCTGG GAGCTCTCTG GCTAACTAGG 
GCCTTGAGTG CTTCAAGTAG TGTGTGCCCG 
CCTCAGACCC TTTTAGTCAG TGTGGAAAAT 
AGCTTTTGTT CCCTTTAGTG AGGGTTAATT 
TTTCCTGTGT GAAATTGTTA TCCGCTCACA 
AAGTGTAAAG CCTGGGGTGC CTAATGAGTG 
CTGCCCGCTT TCCAGTCGGG AAACCTGTCG 
GCGGGGAGAG GCGGTTTGCG TATTGGGCGC 
CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA 
TCCACAGAAT CAGGGGATAA CGCAGGAAAG 
AGGAACCGTA AAAAGGCCGC GTTGCTGGCG 
CATCACAAAA ATCGACGCTC AAGTCAGAGG 
CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 
GGATACCTGT CCGCCTTTCT CCCTTCGGGA 
AGGTATCTCA GTTCGGTGTA GGTCGTTCGC 
GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 
CACGACTTAT CGCCACTGGC AGCAGCCACT 
GGCGGTGCTA C AGAGTT CTT GAAGTGGTGG 
TTTGGTATCT GCGCTCTGCT GAAGCCAGTT 
TCCGGCAAAC AAACCACCGC TGGTAGCGGT 
CGCAGAAAAA AAGGATCTCA AGAAGATCCT 
TGGAACGAAA ACTCACGTTA AGGGATTTTG 
TAGATCCTTT TAAATTAAAA ATGAAGTTTT 
TGGTCTGACA GTTACCAATG CTTAATCAGT 
CGTTCATCCA TAGTTGCCTG ACTCCCCGTC 
CCATCTGGCC CCAGTGCTGC AATGATACCG 
TCAGCAATAA ACCAGCCAGC CGGAAGGGCC 
GCCTCCATCC AGTCTATTAA TTGTTGCCGG 
AGTTTGCGCA ACGTTGTTGC CATTGCTACA 
ATGGCTTCAT TCAGCTCCGG TTCCCAACGA 
TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT 
GTGTTATCAC TCATGGTTAT GGCAGCACTG 
AGATGCTTTT CTGTGACTGG TGAGTACTCA 
CGACCGAGTT GCTCTTGCCC GGCGTCAATA 
TTAAAAGTGC TCATCATTGG AAAACGTTCT 



TCATATGCCA AGTACGCCCC CTATTGACGT 444 0 
TGCCCAGTAC ATGAC CTTAT GGGACTTTCC 4500 
CGCTATTACC ATGGTGATGC GGTTTTGGCA 4 560 
CTCACGGGGA TTTCCAAGTC TCCACCCCAT 4 62 0 
AAATCAACGG GACTTTCCAA AATGTCGTAA 468 0 
TAGGCATGTA CGGTGGGAGG TCTATATAAG 4 74 0 
CTGGAGACGC CATCCACGCT GTTTTGACCT 4 8 00 
CCGCGGCCCC AAGCTT CAGC TGCTCGAGCC 4 86 0 
GAATCCTTCT ACTGCCACCA TTTTTTCTCT 4 92 0 
CCCTGTCCAC CAGTCGAGTA TTACGTCATC 4 98 0 
ACCATTTTTT AGGAACGTCA TCGACTTCGA 504 0 
AGGCCAACGT CATCGACTTC GAAGGTTCGA 510 0 
GTCATCGACT TCGAAGGTTC GAATCCTTCG 516 0 
CTTCGAAGGT TCGAATCCTT CTTGCTTCAC 52 2 0 
GTTCGAATCC TTCTGCTGTC ACCAGTCGAG 5280 
CCTTCACCGG TCACCATTTT TTTATAACGT 534 0 
TTACACCAGT CGAGGTACAC GTCATCGACT 540 0 
TTTTTTGTGC ACGTCATCGA CTTCGAAGGT 54 60 
ATGCCTGCAG GTCGAGGTCG ATACCGTCGA 5520 
AGCAATACAG CAGCTACCAA TGCTGATTGT 558 0 
GTGGGTTTTC CAGTCACACC TCAGGTACCT 564 0 
GATCTTAGCC ACTTTTTAAA AGAAAAGGGG 5700 
AGACAAGATA TCCTTGATCT GTGGATCTAC 5 76 0 
AACTACACAC CAGGGCCAGG GATCAGATAT 582 0 
GTACCAGTTG AGCAAGAGAA GGTAGAAGAA 588 0 
CACCCTGTGA GCCTGCATGG GATGGATGAC 5 940 
GACAGCCGCC TAGCATTTCA TCACATGGCC 6 00 0 
TGCTGACATC GAG CTTGCTA CAAGGGACTT 6 06 0 
CTGGGCGGGA CTGGGGAGTG GCGAGCCCTC 612 0 
CTGTACTGGG TCTCTCTGGT TAGACCAGAT 618 0 
GAACCCACTG CTTAAGCCTC AATAAAGCTT 624 0 
TCTGTTGTGT GACTCTGGTA ACTAGAGATC 6 3 00 
CTCTAGCAGT CGAGGGGGGG CCCGGTACCC 63 6 0 
GCGCGCTTGG CGTAATCATG GTCATAGCTG 642 0 
ATTCCACACA ACATACGAGC CGG AAG CAT A 64 8 0 
AGCTAACTCA CATTAATTGC GTTGCGCTCA 6 54 0 
TGCCAGCTGC ATTAATGAAT CGGCCAACGC 6 6 00 
TCTTCCGCTT CCTCGCTCAC TGACTCGCTG 666 0 
TCAGCTCACT CAAAGGCGGT AATACGGTTA 6 72 0 
AACATGTGAG C AAAAGGC C A GCAAAAGGCC 678 0 
TTTTTC C ATA GGCTCCGCCC CCCTGACGAG 6 84 0 
TGGCGAAACC CGACAGGACT ATAAAGATAC 6900 
CGCTCTCCTG TTCCGACCCT GCCGCTTACC 6 96 0 
AGCGTGGCGC TTTCTCATAG CTCACGCTGT 7 02 0 
TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 7 08 0 
AACTATCGTC TTGAGTCCAA CCCGGTAAGA 714 0 
GGTAACAGGA TTAGCAGAGC GAGGTATGTA 72 00 
CCTAACTACG GCTACACTAG AAGGACAGTA 72 6 0 
ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA 73 2 0 
GGTTTTTTTG TTTGCAAGCA GCAGATTACG 73 8 0 
TTGATCTTTT CTACGGGGTC TGACGCTCAG 744 0 
GTCATGAGAT TATCAAAAAG GATCTTCACC 7 50 0 
AAATCAATCT AAAGTATATA TGAGTAAACT 7 56 0 
GAGGCACCTA TCTCAGCGAT CTGTCTATTT 762 0 
GTGTAGATAA CTACGATACG GGAGGGCTTA 76 8 0 
CGAGACCCAC GCTCACCGGC TCCAGATTTA 774 0 
GAGCGCAGAA GTGGTCCTGC AACTTTATCC 78 0 0 
GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT 7 86 0 
GGCATCGTGG TGTCACGCTC GTCGTTTGGT 7 92 0 
T CAAGGCG AG TTACATGATC CCCCATGTTG 7 980 
CCGATCGTTG TCAGAAGTAA GTTGGCCGCA 8 04 0 
CATAATTCTC TTACTGTCAT GCCATCCGTA 8100 
ACCAAGTCAT TCTGAGAATA GTGTATGCGG 8160 
CGGGATAATA CCGCGCCACA TAGCAGAACT 8220 
TCGGGGCGAA AACTCTCAAG GATCTTACCG 82 8 0 
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CTGTTGAGAT CCAGTTCGAT GTAACCCACT 
ACTTTCACCA GCGTTTCTGG GTGAGCAAAA 
ATAAGGGCGA CACGGAAATG TTGAATACTC 
ATTTATCAGG GTTATTGTCT CATGAGCGGA 
CAAATAGGGG TTCCGCGCAC ATTTCCCCGA 



CGTGCACCCA ACTGATCTTC AGCATCTTTT 8 34 0 

ACAGGAAGGC AAAATGCCGC AAAAAAGGGA 840 0 

ATACTCTTCC TTTTTCAATA TTATTGAAGC 84 6 0 

TACATATTTG AATGTATTTA GAAAAATAAA 852 0 

AAAGTGCCAC 856 0 



SEQ. I.D. NO. 12 - pSYNGP2 - codon optimised HIV-1 gagpoi with leader sequence 

1 GGGTCTCTCT GG TT AG AC C A GATCTGAGCC TGGGAGCTCT CTGGCTAACT AGGGAACCCA 

61 CTG CTTAAGC CTCAATAAAG CTTGCCTTGA GTGCTTCAAG TAGTGTGTGC CCGTCTGTTG 

121 TGTGACTCTG ' GTAACTAGAG ATCCCTCAGA CCCTTTTAGT CAGTGTGGAA AATCTCTAGC 

181 AGTGGCGCCC GAACAGGGAC CTGAAAGCGA AAGGGAAACC AGAGCTCTCT CGACGCAGGA 

241 CTCGGCTTGC TGAAGCGCCC GCACGGCAAG AGGCGAGGGG CGGCGACTGG TGAGTACGCC 

3 01 AAAAATTTTG ACTAGCGGAG GCTAGAAGGA GAGAGATGGG CGCCCGCGCC AGCGTGCTGT 

361 CGGGCGGCGA GCTGGACCGC TGGGAGAAGA TCCGCCTGCG CCCCGGCGGC AAAAAGAAGT 

421 ACAAGCTGAA GCACATCGTG TGGGCCAGCC GCGAACTGGA GCGCTTCGCC GTGAACCCCG 

481 GGCTCCTGGA GACCAGCGAG GGGTGCCGCC AGATCCTCGG CCAACTGCAG CCCAGCCTGC 

541 AAACCGGCAG CGAGGAGCTG CGGAGCCTGT ACAACACCGT GGCCACGCTG TACTGCGTCC 

601 ACCAGCGCAT CGAAATCAAG GATACGAAAG AGGCCCTGGA TAAAATCGAA GAGGAACAGA 

661 ATAAGAGCAA AAAGAAGGCC CAACAGGCCG CCGCGGACAC CGGACACAGC AACCAGGTCA 

721 GCCAGAACTA CCCCATCGTG CAGAACATCC AGGGGCAGAT GGTGCAC CAG GCCATCTCCC 

781 CCCGCACGCT GAACGCCTGG GTGAAGGTGG TGGAAGAGAA GGCTTTTAGC CCGGAGGTGA 

841 TACCCATGTT CTCAGCCCTG T CAGAGGG AG CCACCCCCCA AGATCTGAAC ACCATGCTCA 

901 ACACAGTGGG GGGACACCAG GCCGCCATGC AG ATG CTGAA GGAGACCATC AATGAGGAGG 

961 CTGCCGAATG GG AT CGTGTG CATCCGGTGC ACGCAGGGCC CATCGCACCG GGCCAGATGC 

1021 GTGAGCCACG GGGCTCAGAC ATCGCCGGAA CGACTAGTAC CCTTCAGGAA -CAGATCGGCT 

1081 GGATGACCAA CAACCCACCC ATCCCGGTGG GAGAAATCTA CAAACGCTGG ATCATCCTGG 

1141 GCCTGAACAA GATCGTGCGC ATGTATAGCC CTACCAGCAT CCTGGACATC CGCCAAGGCC 

12 01 CGAAGGAACC CTTTCGCGAC TACGTGGACC GGTTCTACAA AACGCTCCGC GCCGAGCAGG 
1261 CTAGCCAGGA GGTGAAGAAC TGGATGACCG AAACCCTGCT GGTCCAGAAC GCGAACCCGG 

13 21 ACTGCAAGAC GATCCTGAAG GCCCTGGGCC CAGCGGCTAC CCTAGAGGAA ATGATGACCG 
13 81 CCTGTCAGGG AGTGGGCGGA CCCGGCCACA AGGCACGCGT CCTGGCTGAG GCCATGAGCC 
1441 AGGTGACCAA CTCCGCTACC ATCATGATGC AGCGCGGCAA CTTTCGGAAC CAACGCAAGA. 
1501 TCGTCAAGTG CTTCAACTGT GGCAAAGAAG GG CACAC AG C CCGCAACTGC AGGGCCCCTA 
1561 GGAAAAAGGG CTGTTGGAAA TGTGGAAAGG AAGGACACCA AATGAAAGAT TGTACTGAGA 
1621 GACAGGCTAA TTTTTTAGGG AAGATCTGGC CTTCCCACAA GGGAAGGCCA GGGAATTTTC 
1681 TT CAG AG CAG ACCAGAGCCA ACAGCCCCAC CAGAAGAGAG CTTCAGGTTT GGGGAAGAGA 
1741 CAACAACTCC CTCTCAGAAG CAGGAGCCGA TAGACAAGGA ACTGTATCCT TTAGCTTCCC 
18 01 TCAGATCACT CTTTGGCAGC GACCCCTCGT CACAATAAAG ATAGGGGGGC AGCTCAAGGA 
1861 GGCTCTCCTG GACAC CGGAG CAGACGACAC CGTGCTGGAG GAGATGTCGT TGCCAGGCCG 
1921 CTGGAAG C C G AAGATGATCG GGGGAATCGG CGGTTTCATC AAGGTGCGCC AG TATGACC A 
1981 GATCCTCATC GAAATCTGCG GCCACAAGGC TATCGGTACC GTGCTGGTGG GCCCCACACC 
2 041 CGTCAACATC ATCGGACGCA AC CTGTTG AC GCAGATCGGT TGCACGCTGA ACTTCCCCAT 
2101 TAGCCCTATC GAGACGGTAC CGGTGAAGCT ' GAAGCCCGGG ATGGACGGCC CGAAGGTCAA 
2161 GCAATGG C CA TTGACAGAGG AGAAGATCAA GGCACTGGTG GAGATTTGCA CAGAGATGGA 
2221 AAAGGAAGGG AAAATCTCCA AGATTGGGCC TGAGAACCCG TACAACACGC CGGTGTTCGC 
22 81 AATCAAGAAG AAGG ACT CG A CGAAATGGCG CAAGCTGGTG GACTTCCGCG AGCTGAACAA 
2 341 GCGCACGCAA GACTTCTGGG AGGTTCAGCT GGGCATCCCG CACCCCGCAG GGCTGAAGAA 
24 01 GAAGAAATCC GTGACCGTAC TGGATGTGGG TGATGCCTAC TTCTCCGTTC CCCTGGACGA 
2461 AGACTTCAGG AAGTACACTG CCTTCACAAT CCCTTCGATC AACAACGAGA CACCGGGGAT 
2521 TCGATATCAG TACAACGTGC TGCCCCAGGG CTGGAAAGGC TCTCCCGCAA TCTTCCAGAG 
2581 TAG C ATG AC C AAAATC CTGG AGCCTTTCCG CAAACAGAAC CCCGACATCG TCATCTATCA 
2641 GTACATGGAT GACTTGTACG TGGGCTCTGA TCTAGAGATA GGGCAGCACC GCACCAAGAT 

27 01 CGAGGAGCTG CGCCAGCACC TGTTGAGGTG GGGACTG AC C ACACCCGACA AGAAGCACCA 
2 761 GAAGGAGCCT CCCTTCCTCT G G ATGGGTT A CGAGCTGCAC C CTG ACAAAT GGACCGTGCA 
2821 GCCTATCGTG CTGCCAGAGA AAGACAGCTG GACTGTCAAC GACATACAGA AG CTGGTGGG 

28 81 GAAGTTGAAC TGGGCCAGTC AG ATTTACC C AGGGATTAAG GTGAGGCAGC TGTGCAAACT 

2 941 CCTCCGCGGA ACCAAGGCAC TCACAGAGGT GATCCCCCTA ACCGAGGAGG CCGAGCTCGA 

3 001 ACTGGCAGAA AACCGAGAGA TCCTAAAGGA GCCCGTGCAC GGCGTGTACT ATGACCCCTC 
3 061 CAAGGACCTG ATCGCCGAGA TC CAGAAGC A GGGGCAAGGC CAGTGGACCT ATCAGATTTA 
3121 CCAGGAGCCC TTCAAGAACC TGAAGACCGG CAAGTACGCC CGGATGAGGG GTGCCCACAC 
3181 TAACGACGTC AAGCAGCTGA CCGAGGCCGT GCAGAAGATC ACCACCGAAA G CAT CGTG AT 
3 241 CTGGGGAAAG ACTCCTAAGT TCAAGCTGCC CATC CAG AAG GAAACCTGGG AAAC CTGGTG 
3 3 01 GACAGAGTAT TGGCAGGCCA CCTGGATTCC TGAGTGGGAG TTCGTCAACA CCCCTCCCCT 
3 361 GGTGAAGCTG TGGTACCAGC TGGAGAAGGA G CC CAT AGTG GGCGCCGAAA CCTTCTACGT 
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3 421 GGATGGGGCC GCTAACAGGG AGACTAAGCT GGGCAAAGCC GGATACGTCA CTAACCGGGG 

34 81 CAGACAGAAG GTTGTCACCC TCACTGACAC CACCAACCAG AAGACTGAGC TGCAGGCCAT 

3 541 TTACCTCGCT TTGCAGGACT CGGGCCTGGA GGTGAACATC GTGACAGACT CTCAGTATGC 

3 6 01 CCTGGGCATC ATTCAAGCCC AGCCAGACCA GAGTGAGTCC GAGCTGGTCA ATCAGATCAT 

3 661 CGAGCAGCTG ATCAAGAAGG AAAAGGTCTA TCTGGCCTGG GTACCCGCCC AC AAAGG CAT 

3 721 TGGCGGCAAT GAG C AGGTCG ACAAGCTGGT CTCGGCTGGC ATCAGGAAGG TGCTATTCCT 

3 781 GGATGGCATC GACAAGGCCC AGGACGAGCA CGAGAAATAC CACAGCAACT GGCGGGCCAT 

3 841 GGCTAGCGAC TTCAACCTGC CCCCTGTGGT GGCCAAAGAG ATCGTGGCCA GCTGTGACAA 

3 901 GTGTCAGCTC AAGGGCGAAG CCATGCATGG CCAGGTGGAC TGTAGCCCCG GCATCTGGCA 

3 961 ACTCGATTGC ACCCATCTGG AGGGCAAGGT TATCCTGGTA GCCGTCCATG TGGCCAGTGG 

4 021 CTACATCGAG GCCGAGGTCA TTCCCGCCGA AACAGGGCAG GAGACAGCCT ACTTCCTCCT 
4 081 GAAGCTGGCA GGCCGGTGGC CAGTGAAGAC CATCCATACT GACAATGGCA GCAATTTCAC 
4141. CAGTGCTACG GTTAAGGCCG CCTGCTGGTG GGCGGGAATC AAGCAGGAGT TCGGGATCCC 
42 01 CTACAATCCC CAGAGTCAGG GCGTCGTCGA GTCTATGAAT AAGGAGTTAA AGAAGATTAT 
4261 CGGCCAGGTC AGAGATCAGG CTGAGCATCT CAAGACCGCG GTCCAAATGG CGGTATTCAT 
4321 CCACAATTTC AAGCGGAAGG GGGGGATTGG GGGGTACAGT GCGGGGGAGC GGATCGTGGA 
4381 CATCATCGCG ACCGACATCC AGACTAAGGA GCTGCAAAAG CAGATTACCA AGATTCAGAA 
4441 TTTCCGGGTC TACTACAGGG ACAGCAGAAA TCCCCTCTGG AAAGGCCCAG CGAAGCTCCT 
4 501 CTGGAAGGGT GAGGGGGCAG TAGTGATCCA GGATAATAGC GACATCAAGG TGGTGCCCAG 
4 561 AAGAAAGGCG AAGATCATTA GGGATTATGG CAAACAGATG GCGGGTGATG ATTGCGTGGC 
4 621 GAG C AG AC AG GATGAGGATT AG 



SEQ. I.D. NO. 13 - pSYNGP3 - codon optimised HTV-1 gagpol with leader sequence from 
the major splice donor 

1 GTGAGTACGC CAAAAATTTT GACTAGCGGA GGCTAGAAGG AGAGAGATGG GCGCCCGCGC 
61 CAGCGTGCTG TCGGGCGGCG AGCTGGACCG CTGGGAGAAG ATCCGCCTGC GCCCCGGCGG 
121 CAAAAAGAAG TACAAGCTGA AGCACATCGT GTGGGC CAGC CGCGAACTGG AGCGCTTCGC 
181 CGTGAACCCC GGGCTCCTGG AGACCAGCGA GGGGTGCCGC CAGATCCTCG GCCAACTGCA 
241 GCCCAGCCTG CAAACCGGCA GCGAGGAGCT GCGCAGCCTG TACAACACCG TGGCCACGCT 
3 01 GTACTGCGTC CACCAGCGCA TCGAAATCAA GGATACGAAA GAGGCCCTGG ATAAAATCGA 
3 61 AGAGGAACAG AATAAGAGCA AAAAGAAGGC CCAACAGGCC GCCGCGGACA C CGGACAC AG 
421 CAACCAGGTC AGCCAGAACT ACCCCATCGT GCAGAACATC CAGGGGCAGA TGGTGCACCA 
481 GGCCATCTCC CCCCGCACGC TGAACGCCTG GGTGAAGGTG GTGGAAGAGA AGGCTTTTAG 
541 CCCGGAGGTG ATACCCATGT TCTCAGCCCT GTCAGAGGGA GCCACCCCCC AAGAT CTGAA 
6 01 CACCATGCTC AACACAGTGG GGGGACACCA GGCCGCCATG CAGATGCTGA AGGAGACCAT 
661 CAATGAGGAG GCTGCCGAAT GGGATCGTGT GCATCCGGTG CACGCAGGGC CCATCGCACC 
721 GGGCCAGATG CGTGAGCCAC GGGG CTCAGA CATCGCCGGA ACGACTAGTA CCCTTCAGGA 
781 ACAGATCGGC TGGATGACCA ACAACCCACC CATCCCGGTG GGAGAAATCT ACAAACGCTG 
8 41 GATCATCCTG GGCCTGAACA AGATCGTGCG CATGTATAGC CCTACCAGCA TCCTGGACAT 
901 CCGCCAAGGC CCGAAGGAAC CCTTTCGCGA CTACGTGGAC CGGTTCTACA AAACGCTCCG 
961 CGCCGAGCAG GCTAGCCAGG AGGTGAAGAA CTGGATGACC GAAACCCTGC TGGTCCAGAA 
1021 CGCGAACCCG GACTGCAAGA CGATC CTGAA GGCCCTGGGC C CAGCGGCT A CCCTAGAGGA 
1081 AATGATG AC C GCCTGTCAGG GAGTGGGCGG ACCCGGCCAC AAGGCACGCG TCCTGGCTGA 
1141 GGCCATGAGC CAGGTGACCA ACTCCGCTAC CATCATGATG CAGCGCGGCA ACTTTCGGAA 
1201 CCAACGCAAG ATCGTCAAGT GCTTCAACTG TGGCAAAGAA GGGCACACAG CCCGCAACTG 
1261 CAGGGCCCCT AGGAAAAAGG GCTGTTGGAA ATGTGGAAAG GAAGGACACC AAATGAAAGA 
13 21 TTGTACTGAG AGACAGGCTA ATTTTTTAGG GAAGATCTGG CCTTCCCACA AGGGAAGGCC 
13 81 AGGGAATTTT CTTCAGAGCA GACCAG AG CC AACAGCCCCA C CAGAAG AG A GCTTCAGGTT 
1441 TGGGGAAGAG ACAACAACTC CCTCTCAGAA GCAGGAGCCG ATAGACAAGG AACTGTATCC 
1501 TTTAGCTTCC CTC AG AT C AC TCTTTGGCAG CGACCCCTCG TCACAATAAA GATAGGGGGG 

15 61 CAGCTCAAGG AGGCTCTCCT GGACACCGGA GCAGACGACA CCGTGCTGGA GGAGATGTCG 
1621 TTGCCAGGCC GCTGGAAGCC GAAGATGATC GGGGGAATCG GCGGTTTCAT CAAGGTGCGC 

16 81 CAGTATGACC AGATCCTCAT CGAAATCTGC GGCCACAAGG CTATCGGTAC CGTG CTGGTG 
1741 GGCCCCACAC CCGTCAACAT CATCGGACGC AACCTGTTGA CGCAGATCGG TTGCACGCTG 
18 01 AACTTCCCCA TTAGCCCTAT CGAGACGGTA CCGGTGAAGC TGAAGCCCGG GATGGACGGC 
1861 CCGAAGGTCA AG CAATGG C C ATTGACAGAG GAGAAGATCA AGGCACTGGT GGAGATTTGC 
1921 ACAGAGATGG AAAAGGAAGG GAAAATCTCC AAGATTGGGC CTGAGAACCC GTACAACACG 
1981 CCGGTGTTCG CAATCAAGAA GAAGGACTCG ACGAAATGGC GCAAGCTGGT GGACTTCCGC 
2 041 GAGCTGAACA AGCGCACGCA AGACTTCTGG GAGGTTCAGC TGGGCATCCC GCACCCCGCA 
2101 GGGCTGAAGA AG AAGAAAT C CGTGACCGTA CTGGATGTGG GTGATGCCTA CTTCTCCGTT 
2161 CCCCTGGACG AAGACTTCAG GAAGTACACT GCCTTCACAA TCCCTTCGAT CAACAACGAG 
2221 ACACCGGGGA TTCGATATCA GTACAACGTG CTGCCCCAGG GCTGGAAAGG CTCTCCCGCA 
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22 81 ATCTTCCAGA GTAGCATGAC CAAAATCCTG 
2 341 . GTCATCTATC AGTACATGGA TGACTTGTAC 
24 01 CGCACCAAGA TCGAGGAGCT GCGCCAGCAC 
24 61 AAGAAGC AC C AGAAGGAGCC TCCCTTCCTC 
2 521 TGGACCGTGC AGCCTATCGT GCTGCCAGAG 
2 581 AAGCTGGTGG GGAAGTTGAA CTGGGCCAGT 
2 6 41 CTGTGCAAAC TCCTCCGCGG AACCAAGGCA 
.2 7 01 GCCGAGCTCG AACTGGCAGA AAAC CGAGAG 
2 761 TATGACCCCT CCAAGGACCT GATCGCCGAG 
2 821 TATCAGATTT ACCAGGAGCC CTTCAAGAAC 
28 81 GGTGCCCACA CTAACGACGT CAAGCAGCTG 

2 941 AGCATCGTGA TCTGGGGAAA GACTCCTAAG 

3 001 GAAACCTGGT GGACAGAGTA TTGGCAGGCC 
3 061 ACCCCTCCCC TGGTGAAGCT GTGGTACCAG 
3121 ACCTTCTACG TGGATGGGGC CGCTAACAGG 
3181 ACTAACCGGG GCAGACAGAA GGTTGTCACC 
3 241 CTGCAGGCCA TTTACCTCGC TTTGCAGGAC 

33 01 TCTCAGTATG CCCTGGGCAT CATTCAAGCC 
3 3 61 AATCAGATCA TCGAGCAGCT GATCAAGAAG 
3421 CACAAAGGCA TTGGCGGCAA TGAGCAGGTC 

34 81 GTGCTATTCC TGGATGGCAT CGACAAGGCC 
3 541 TGGCGGGCCA TGGCTAGCGA CTTCAACCTG 
3 6 01 AG CTGTGAC A AGTGTCAGCT CAAGGGCGAA 
36 61 GGCATCTGGC AACTCGATTG CACCCATCTG 
3 721 GTGGCCAGTG GCTACATCGA GGCCGAGGTC 
3 7 81 TACTTCCTCC TGAAGCTGGC AGGCCGGTGG 
3 841 AGCAATTT C A CCAGTGCTAC GGTTAAGGCC 
3 901 TTCGGGATCC CCTACAATCC CCAGAGTCAG 

3 961 AAGAAGATTA TCGGCCAGGT CAGAGATCAG 

4 021 GCGGTATTCA TCCACAATTT CAAGCGGAAG 
4081 CGGATCGTGG ACATCATCGC GACCGACATC 
4141 AAGATTCAGA ATTTCCGGGT CTACTACAGG 
42 01 GCGAAGCTCC TCTGGAAGGG TGAGGGGGCA 

42 61 GTGGTGCCCA GAAGAAAGGC GAAGATCATT 

43 21 GATTGCGTGG CGAGCAGACA GGATGAGGAT 



GAGCCTTTCC GCAAACAGAA CCCCGACATC 
GTGGGCTCTG ATCTAGAGAT AGGGCAGCAC 
CTGTTGAGGT GGGGACTGAC CACACCCGAC 
TGGATGGGTT ACGAGCTGCA CCCTGACAAA 
AAAG AC AG C T GGACTGTCAA CGACATACAG 
C AG ATTTAC C CAGGGATTAA GGTGAGGCAG 
CTCACAGAGG TGATCCCCCT AACCGAGGAG 
AT C CT AAAG G AGCCCGTGCA CGGCGTGTAC 
ATCCAGAAGC AGGGGCAAGG CCAGTGGACC 
CTGAAGACCG GCAAGTACGC CCGGATGAGG 
ACCGAGGCCG TGCAGAAGAT CACCACCGAA 
TTCAAGCTGC C CATCCAGAA GGAAACCTGG 
ACCTGGATTC CTGAGTGGGA GTTCGTCAAC 
CTGGAGAAGG AGCCCATAGT GGGCGCCGAA 
GAGACTAAGC TGGGCAAAGC CGGATACGTC 
CTCACTGACA CCACCAACCA GAAGACTGAG 
TCGGGCCTGG AGGTGAACAT CGTGACAGAC 
CAGCCAGACC AGAGTGAGTC CGAGCTGGTC 
GAAAAGGTCT ATCTGGCCTG GGTACCCGCC 
GACAAGCTGG TCTCGGCTGG CAT C AGG AAG 
CAGGACGAGC ACGAGAAATA CCACAGCAAC 
CCCCCTGTGG TGGC CAAAGA GATCGTGGCC 
GCCATG CATG GCCAGGTGGA CTGTAGCCCC 
GAGGGCAAGG TTATCCTGGT AGCCGTCCAT 
ATTCCCGCCG AAACAGGGCA GGAGACAGCC 
CCAGTGAAGA CCATCCATAC TGACAATGGC 
GCCTGCTGGT GGGCGGGAAT CAAGCAGGAG 
GGCGTCGTCG AGT CTATGAA TAAGGAGTTA 
GCTGAGCATC T C AAG AC CG C GGTCCAAATG 
GGGGGGATTG GGGGGTACAG TGCGGGGGAG 
CAGACTAAGG AGCTGCAAAA GCAGATTACC 
GACAGCAGAA ATCCCCTCTG GAAAGGCCCA 
GTAGTGATCC AGGATAATAG CGACATCAAG 
AGGGATTATG GCAAACAGAT GGCGGGT GAT 
TAG 



SEQ. LD. NO. 14 - pSYNGP4 - codon optimised HTV-1 gagpol with 20 bp of the leader 
sequence of HIV-1 , upstream of the start codon of ATG. 

1 CGGAGGCTAG AAGGAGAGAG ATGGGCGCCC GCGCCAGCGT GCTGTCGGGC GGCGAGCTGG 
61 ACCGCTGGGA GAAGATCCGC CTGCGCCCCG GCGGCAAAAA GAAGTACAAG CTGAAGCACA 
121 TCGTGTGGGC CAGCCGCGAA CTGGAGCGCT TCGCCGTGAA CCCCGGGCTC CTGG AG AC C A 
181 GCGAGGGGTG CCGCCAGATC CTCGGCCAAC TGCAGCCCAG CCTGCAAACC GGCAGCGAGG 
241 AGCTGCGCAG CCTGTACAAC ACCGTGGCCA CGCTGTACTG CGTCCACCAG CGCATCGAAA 
301 TCAAGGATAC GAAAGAGGCC CTGGATAAAA TCGAAGAGGA ACAGAATAAG AGCAAAAAGA 
361 AGGCCCAACA GGCCGCCGCG GACACCGGAC ACAGCAACCA GGTCAGC CAG AACTACCCCA 
421 TCGTGCAGAA CATCCAGGGG CAGATGGTGC ACCAGGCCAT CTCCCCCCGC ACGCTGAACG 
481 CCTGGGTGAA GGTGGTGGAA GAGAAGGCTT TTAGCCCGGA GGTGATACCC ATGTTCTCAG 
541 CCCTGTCAGA GGGAGCCACC CCCCAAGATC TGAACACCAT GCTCAACACA GTGGGGGGAC 
601 ACCAGGCCGC CATGCAGATG CTGAAGGAGA CCATCAATGA GGAGGCTGCC GAATGGGATC 
661 GTGTGCATCC GGTGCACGCA GGGCCCATCG CACCGGGCCA GATGCGTGAG CCACGGGGCT 
721 CAGACATCGC CGGAACGACT AGTACCCTTC AGGAACAGAT CGGCTGGATG ACCAACAACC 
781 CACCCATCCC GGTGGGAGAA ATCTACAAAC GCTGGATCAT CCTGGGCCTG AACAAGATCG 
841 TGCGCATGTA TAGCCCTACC AGCATCCTGG ACATCCGCCA AGGCCCGAAG GAACCCTTTC 
901 GCGACTACGT GGAC CGGTTC TACAAAACGC TCCGCGCCGA GCAGGCTAGC CAGGAGGTGA 
961 AGAACTGGAT G AC CGAAACC CTGCTGGTCC AGAACGCGAA CCCGGACTGC AAGACGATCC 
1021 TGAAGGCCCT GGGC CCAGCG GCTACCCTAG AGGAAATGAT GACCGCCTGT CAGGGAGTGG 
10 81 GCGGACCCGG CCACAAGGCA CGCGTCCTGG CTGAGGCCAT GAGCCAGGTG ACCAACTCCG 
1141 CTAC CAT CAT GATGCAGCGC GGCAACTTTC GGAACCAACG CAAGATCGTC AAGTGCTTCA 
12 01 ACTGTGGCAA AGAAGGGCAC ACAGCCCGCA ACTGCAGGGC CCCTAGGAAA AAGGGCTGTT 

12 61 GGAAATGTGG AAAGGAAGGA C AC CAAATG A AAGATTGTAC TGAGAGACAG GCTAATTTTT 

13 21 TAGGGAAGAT CTGGCCTTCC CACAAGGGAA GGCCAGGGAA TTTTCTTCAG AGCAGAC CAG 
13 81 AGCCAACAGC CCCACCAGAA GAG AG CTTC A GGTTTGGGGA AGAGACAACA ACTCCCTCTC 
1441 AGAAGCAGGA GCCGATAGAC AAGGAACTGT ATCCTTTAGC TTCCCTCAGA TCACTCTTTG 
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1501 GCAGCGACCC CTCGTCACAA TAAAGATAGG GGGGCAGCTC AAGGAGGCTC TCCTGGACAC 
1561 CG GAG C AG AC GACACCGTGC TGGAGGAGAT GTCGTTGCCA GGCCGCTGGA AGCCGAAGAT 
1621 GAT CGGGGG A ATCGGCGGTT TCATCAAGGT GCGCCAGTAT GACCAGATCC TCATCGAAAT 
16 81 CTGCGGCCAC AAGGCTATCG GTACCGTGCT GGTGGGCCCC ACACCCGTCA ACATCATCGG 
1741 ACGCAACCTG TTGACGCAGA TCGGTTGCAC GCTGAACTTC CCCATTAGCC CTATCGAGAC 
18 01 GGTAC CGGTG AAGCTGAAGC CCGGGATGGA CGGCGCGAAG GTCAAGCAAT GGCCATTGAC 
1861 AGAGGAGAAG ATCAAGGCAC TGGTGGAGAT TTGCACAGAG ATGGAAAAGG AAGGGAAAAT 
1921 CTCCAAGATT GGGCCTGAGA ACCCGTACAA CACGCCGGTG TTCGCAATCA AGAAGAAGGA 
1981 CTCGACGAAA TGGCGCAAGC TGGTGGACTT CCGCGAGCTG AACAAGCGCA CGCAAGACTT 
2 041 CTGGGAGGTT CAGCTGGGCA TCCCGCACCC CGCAGGGCTG AAGAAGAAGA AATCCGTGAC 
2101 CGTACTGGAT GTGGGTGATG CCTACTTCTC CGTTCCCCTG GACGAAGACT TCAGGAAGTA 
2161 CACTGCCTTC ACAATCCCTT CGATCAACAA CGAGACACCG GGGATTCGAT ATCAGTAGAA 
2221 .CGTGCTGCCC CAGGGCTGGA AAGGCTCTCC CGCAATCTTC CAGAGTAGCA TGACCAAAAT 
2281 CCTGGAGCCT TTCCGCAAAC AGAAC CCCG A CATCGTCATC TATCAGTACA TGGATGACTT 
2341 GTACGTGGGC TCTGATCTAG AGATAGGG C A GCACCGCACC AAGATCGAGG AGCTGCGCCA 
24 01 GCACCTGTTG AGGTGGGGAC TGACCACACC CGACAAGAAG CACCAGAAGG AGCCTCCCTT 
2 461 CCTCTGGATG G GTTACG AGC TGCACCCTGA CAAATGGACC GTGCAGCCTA TCGTGCTGCC 
2521 AGAGAAAGAC AGCTGGACTG TCAACGACAT ACAGAAGCTG GTGGGGAAGT TGAACTGGGC 
2 581 CAGTGAGATT TACCCAGGGA TTAAGGTGAG GCAGCTGTGC AAACTCCTCC GCGGAACCAA 
2 641 GGCACTCACA GAGGTGATCC CCCTAACCGA GGAGGCCGAG CTCGAACTGG CAGAAAACCG 
2 701 AGAGATCCTA AAGGAGCCCG TGCACGGCGT GTACTATGAC CCCTCCAAGG ACCTGATCGC 
2 761 CGAGATCCAG AAGCAGGGGC AAGGCCAGTG GACCTATCAG ATTTACCAGG AGCCCTTCAA 
2 821 GAACCTGAAG ACCGG CAAGT ACGCCCGGAT GAGGGGTGCC CACACTAACG ACGTCAAGCA 
2 881 GCTGACCGAG GCCGTGCAGA AGATCACCAC CGAAAGCATC GTGATCTGGG GAAAGACTCC 

2 941 TAAGTTCAAG CTGCCCATCC AGAAGGAAAC CTGGGAAACC TGGTGGACAG AGTATTGGCA 

3 001 GGCCACCTGG ATTCCTGAGT GGGAGTTCGT CAACACCCCT CCCCTGGTGA AGCTGTGGTA 
3 061 C C AG CTGGAG AAGGAGCCCA TAGTGGGCGC CGAAACCTTC TACGTGGATG GGG CCGCTAA 
3121 CAGGGAGACT AAGCTGGGCA AAGCCGGATA CGTCACTAAC CGGGGCAGAC AGAAGGTTGT 
3181 CACCCTCACT GACACCACCA ACCAGAAGAC TGAGCTGCAG GCCATTTACC TCGCTTTGCA 
3 241 GGACTCGGGC CTGGAGGTGA ACATCGTGAC AGACTCTCAG TATGCCCTGG GCATCATTCA 
3 3 01 AGCCCAGCCA GACCAGAGTG AGTCCGAGCT GGTCAATCAG ATCATCGAGC AGCTGATCAA 
3 361 GAAGGAAAAG GTCTATCTGG CCTGGGTACC CGCCCACAAA GGCATTGGCG GCAATGAGCA 
3421 GGTCGACAAG CTGGTCTCGG CTGGCATCAG GAAGGTGCTA TTCCTGGATG GCATCGACAA 
3481 GGCCCAGGAC GAGCACGAGA AATACCACAG CAACTGGCGG GCCATGGCTA GCGACTTCAA 
3 541 CCTGCCCCCT GTGGTGGCCA AAGAGATCGT GGCCAGCTGT GACAAGTGTC AG CT CAAGGG 
3 6 01 CGAAGCCATG CATGGCCAGG TGGACTGTAG CCCCGGCATC TGGCAACTCG ATTGCACCCA 
3 661 TCTGGAGGGC AAGGTTATCC TGGTAGCCGT CCATGTGGCC AGTGGCTACA TCGAGGCCGA 
3 721 GGTCATTCCC GCCGAAACAG GGCAGGAGAC AGCCTACTTC CTCCTGAAGC TGGCAGGCCG 
3 781 GTGGCCAGTG AAG AC CATCC ATACTGACAA TGGCAG CAAT TTCACCAGTG CTACGGTTAA 
3 841 GGCCGCCTGC TGGTGGGCGG GAATCAAGCA GGAGTTCGGG ATCCCCTACA ATCC CCAGAG 
3 901 TCAGGGCGTC GTCGAGTCTA TGAATAAGGA GTTAAAGAAG ATTATCGGCC AGGTCAGAGA 

3 961 TCAGGCTGAG CATCTCAAGA CCGCGGTCCA AATGGCGGTA TTCATCCACA ATTTCAAGCG 

4 021 GAAGGGGGGG ATTGGGGGGT ACAGTGCGGG GGAGCGGATC GTGGAC AT C A TCGCGACCGA 
4081 CATCCAGACT AAGGAGCTGC AAAAGCAGAT TACCAAGATT CAGAATTTCC GGGTCTACTA 
4141 CAGGGACAGC AGAAATCCCC TCTGGAAAGG CCCAGCGAAG CTCCTCTGGA AGGGTGAGGG 
42 01 GGCAGTAGTG ATC CAGGAT A AT AG CG ACAT CAAGGTGGTG CCCAGAAGAA AGGCGAAGAT 
4261 CATTAGGGAT TATGGCAAAC AGATGGCGGG TGATGATTGC GTGGCGAGCA GACAGGATGA 
4321 GGATTAG 



